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Despite intense efforts to develop novel and better tools to identify known viruses and to discover new 
viruses, establishing etiological roles for viruses in human disease is challenging. In large part, this may 
be attributed to the high variability of viral species and the difficulties in developing broad-spectrum, 
yet specific, diagnostic assays. To overcome this problem, a novel method for the detection of viruses is 
described in the current manuscript. The technique relies on the addition of synthetic oligonucleotides 


to both termini of RNA fragments in a sequence-dependent manner during first- and second-strand 
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DNA synthesis; these oligonucleotides are used subsequently for amplification of the viral nucleic acids 
of interest. The recognition of the target sequence by the oligonucleotides is mediated by short (6-8 nt) 
conserved regions, which facilitates development of broad-spectrum assays. The method has been tested 
for coronaviruses, although it may be also adopted for other RNA viruses. 


© 2012 Elsevier B.V. All rights reserved. 


1. Introduction 


Viral infections impose a great disease burden on the human 
population. As well as known viral diseases, viral infection is 
thought to play a role in the pathogenesis of many other diseases. 
However, because the detection and identification of viruses is dif- 
ficult, establishing etiological roles for viruses in these diseases is 
challenging. This is particularly true for chronic conditions (e.g., 
Kawasaki disease) where the virus load is very low or the presence 
of the virus at the time of sampling is uncertain (Baker et al., 2006; 
Christen et al., 2012; Simoes et al., 2012). Surprisingly, it may also 
be true for some acute infections (e.g., respiratory tract infections), 
where no etiological agent can be identified in a large proportion 
of cases (Ali et al., 2011; Bezerra et al., 2011; Hayden et al., 2012; 
Mahony et al., 2011; Regamey et al., 2008; Sanghavi et al., 2012; 
Thomazelli et al., 2007). Although the failure to identify an infect- 
ing virus may be due to the presence of a novel pathogen, we have 
shown previously that it can also be due to the imperfect diagnostic 
system (Pyrc et al., 2012). 

Coronaviruses are large, enveloped, single-stranded RNA 
viruses. The genomic nucleic acids (27-32 kb) are shielded by the 
nucleocapsid protein and interact with membrane, envelope and 
spike proteins to form a viable, enveloped viral particle (Fields et al., 
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2007). At present, three major groups of coronaviruses are recog- 
nized (alpha-, beta- and gamma-coronaviruses ), which are clusters 
of species sharing a similar genome structure and other general 
characteristics (International Committee on Taxonomy of Viruses. 
and King, 2012). Generally, alpha- and beta-coronaviruses are con- 
sidered to be mammalian pathogens, while gamma-coronaviruses 
mostly infect birds (Fields et al., 2007). Until 2003, only two human 
coronaviruses were recognized and the research on this group of 
pathogens was relatively limited (Bradburne et al., 1967; Kapikian 
et al., 1969; McIntosh et al., 1967b, 1974; Reed, 1984; Tyrrell and 
Bynoe, 1965). The emergence of the severe acute respiratory syn- 
drome coronavirus (SARS-CoV) in the 2002/2003 winter season 
highlighted the importance of the Coronaviridae family for human 
health (Drosten et al., 2003; Ksiazek et al., 2003; Peiris et al., 2003). 
Realization that there was a threat of animal—human transmission 
of coronaviruses accelerated research into these viruses, and this 
resulted in the identification of two novel human coronaviruses, 
HCoV-NL63 and HCoV-HKU1, which appeared to have resided in 
the human population for many years, undetected by diagnostic 
methods used at the time (van der Hoek et al., 2004; Woo et al., 
2005). 

There are several techniques described for the detection of coro- 
naviruses. The most generic and oldest is electron microscopy, 
which allows visualization of viral particles in an infectious mate- 
rial (Hamre and Procknow, 1966; McIntosh et al., 1967a). Since the 
identification of the first coronaviruses with electron microscopy, 
several other methods have been developed, ranging from cell cul- 
ture to immune detection and molecular detection (Pyrcetal., 2011, 
2012; Schildgen et al., 2006). Recently, more sophisticated methods 
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were developed, including sequence-independent techniques that 
allow amplification of all nucleic acids, regardless of the sequence. 
Such approaches result in the amplification of targets originating 
from the host’s genetic material, as well as targets of viral origin. 
Comparison of non-infected control samples with virus-infected 
samples allowed researchers to select nucleic acid sequence motifs 
present exclusively in the latter set of samples (Allander et al., 2001, 
2005; Anway et al., 2001; Lu et al., 2004; van der Hoek et al., 2004). 
An interesting (and commonly used) diagnostic approach employs 
degenerated universal primers able to detect all coronaviruses that 
share sequence homology at the selected, conserved site; theo- 
retically, this also includes members of the family unrecognized 
previously (Adachi et al., 2004; Drosten et al., 2003; Escutenaire 
et al., 2007; Ksiazek et al., 2003; Moes et al., 2005; Stephensen 
et al., 1999). This approach is promising, although high variability 
of viral sequences coupled with limitations to the level of degener- 
ation of primers may lead to false-negative results if viral species 
are not identical to reference strains. To overcome this problem, 
the CODEHOP technique was developed, allowing the design of 
highly degenerated DNA primers based on more conserved protein 
sequences (Rose et al., 1998; Zlateva et al., 2011). 

In the current manuscript, we describe the development of 
a novel method for the detection of RNA viruses, combining to 
some extent the selectivity of the universal primer approach 
with the broad specificity of sequence-independent methods. The 
method relies on the design of synthetic oligonucleotides that com- 
prise short (6-8 nt) elements that recognize conserved regions of 
viral nucleic acids and longer (16-18 nt) anchor elements. These 
oligonucleotides are then added to termini of RNA fragments dur- 
ing first- and second-strand DNA synthesis. The anchoring regions 
are subsequently used for amplification and analysis of viral nucleic 
acids. The use of short (6-8 nt) conserved regions to recognize 
target sequences facilitates the design and development of broad- 
spectrum assays, and therefore increases the applicability of the 
method. Although the sensitivity of the method is limited, and thus 
most suitable for the detection of cultivable pathogens, its poten- 
tial for recognizing unknown members of the Coronaviridae family 
means that this is a valuable addition to the range of virus detection 
and identification tools that already exist. 


2. Materials and methods 
2.1. Viruses 


Virus stocks were generated by infecting HeLa cells (ATCC: CCL- 
2) with adenovirus; RD cells (ATCC: CCL-136) with enterovirus; 
LLC-MK2 cells (ATCC: CCL-7) with parainfluenza types 1-3, HCoV- 
NL63, human metapneumovirus (hMPV) and human respiratory 
syncytial virus (hRSV); and MDCK cells (ATCC: CCL-2936) with 
influenza A and B. In all cases, cells were lysed by two freeze-thaw 
cycles after the appearance of the cytopathic effect. The virus- 
containing fluid was aliquoted and stored at —80°C. A control cell 
lysate from mock-infected cells was prepared in the same manner 
as the virus stock. 

Parainfluenza 1 and 2 viruses and influenza B virus were isolated 
from clinical samples processed in our laboratory. Stock samples 
containing influenza A virus, enterovirus (human echovirus 9), 
hRSV, human parainfluenza 3 virus and human adenovirus were 
provided kindly by Marcel Muller (University of Bonn Medical 
Center, Germany); HCoV-NL63 was a kind gift from Lia van der 
Hoek (Academic Medical Center, University of Amsterdam, The 
Netherlands). hMPV and human coronavirus HKU1 were provided 
kindly by Oliver Schildgen (University Hospital Witten/Herdecke, 
Cologne, Germany) and Astrid Vabret (University Hospital of Caen, 
France), respectively. 


2.2. Selection of HexaPrime primers 


To identify conserved sites in coronavirus sequences, three 
approaches were used: (1) dedicated HexaPrime software 
(described in Section 3.2: HexaPrime software), (2) Bioedit 
(ver. 7.1.3.0; http://www.mbio.ncsu.edu/bioedit/bioedit.html) 
automatic conserved site finder, and (3) visual inspection. Appli- 
cation of all these approaches allowed appropriate screening 
of coronavirus sequence data. GenBank numbers of sequences 
included in the analysis are as follows: HCoV-NL63 (DQ445911; 
DQ445912; AY567487; NC_005831; JQ765566; JQ765564; 
JQ765568; JQ765570; JQ765572; JQ765574; JQ765565; JQ765563; 
JQ765567; JQ765571; JQ765569; JQ765575; JQ765573), HCoV- 
OC43 (JN129835; JN129834; AY391777; AY585229; AY585228; 
AY903459; AY903460; DQ011855; NC_007732; NC_005147; 
FJ415324), HCoV-HKU1 (HM034837; AY597011; DQ415914; 


DQ415912; DQ415910; DQ415908; DQ415906; DQ415904; 
DQ415902; DQ415900; DQ415898; DQ415896; DQ415913; 
DQ415911; DQ415909; DQ415907; DQ415905; DQ415903; 
DQ415901; DQ415899), HCoV-229E (NC_002645; AF304460), 
SARS-CoV (NC_004718, JX162087, JQ316196, AY559089, 
AY559095, AY613950, AY545914, AP006558, APO006561, 
EU371561, EU371564, AY278554, AY338175, AY357076, 


FJ882945, AY279354, GU553363, AY313906, FJ882930, FJ882936, 
HQ890529, JF292908, HQ890538, JF292915), BCoV (AF391541; 
AF391542; NC_003045; EF424618; EF424616), MHV (JQ173883; 
NC_006852; GU593319; AY700211; AF208066; AF208067; 
AB551247; AF201929; FJ884687; FJ884686; NC_001846; FJ647227; 
FJ647225; FJ647223; FJ647221; FJ647219; FJ647226; Fj647224; 
FJ647222; FJ647218; FJ647220), porcine epidemic diarrhea 
virus (JQ023162; JQ023161; JN825712; JQ282909; JN547228; 
AF353511; DQ811787; NC_003436; EF185992; Z25483; Z24733), 
canine coronavirus (JQ404410, JQ404409, GQ477367), equine 
coronavirus (NC_010327; EF446615), porcine hemagglutinating 
encephalomyelitis virus (NC_007732; DQ011855), feline coro- 
navirus (JQ408980; GQ152141; JN183882; JN183883; FJ938058; 
FJ938060; HQ392471; HQ392469; HQ392472; HQ392470; 
HQ012371; HQO012369; HQ012367; HQO012372; HQO012370; 
HQ012368; GU553361; GU553362; FJ938062; FJ938059; 
FJ938054; FJ938052; FJ938061; FJ938057; FJ938055; FJ938053; 
DQ848678; DQ010921; EU186072; DQ286389; JQ408980; 
AY994055; NC_002306; JQ408981), transmissible gastroen- 
teritis virus (DQ811786; DQ811789; FJ755618; HQ462571; 
EU074218; DQ201447; DQ811788; DQ811785; DQ443743; 
AJ271965). 


2.3. Nucleic acid extraction 


RNA from clinical specimens and viral culture supernatants was 
extracted using a Total RNA mini kit (A&A Biotechnology, Gdy- 
nia, Poland), according to the manufacturer’s protocol. DNA was 
isolated using a Viral DNA/RNA kit (A&A Biotechnology, Gdynia, 
Poland), according to the manufacturer’s instructions. Following 
extraction, samples were incubated for 30 min at 37 °C with DNase 
(DNase Turbo, Thermo Scientific, Vilnius, Lithuania) and the RNA 
was re-isolated. Following isolation, total RNA was precipitated 
in the presence of 30yg of glycogen (Life Technologies, War- 
saw, Poland) with three volumes of isopropanol (16h, —20°C). 
Samples were centrifuged subsequently (12,000 x g, 45 min) and 
the resulting pellets were washed with 70% ethanol, dried (2 min 
at room temperature) and re-suspended in 5 wl of nuclease-free 
water (Sigma-Aldrich, Poznan, Poland). Purified RNA was stored 
at —80°C. 
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Table 1 
Primers used for quantitative PCR. 


Target species Primer 


HCoV-NL63 Sense primer [63NF2]| 
Antisense primer [63NR1] 
Probe [63NP] 

HCoV-HKU1 Sense primer [HKUqPCR5] 


Antisense primer [HKUgPCR3] 
Probe [HKUqPCRP] 


2.4. Virus detection by reverse transcription and quantitative PCR 


Isolated nucleic acids (see Section 2.3) were reverse tran- 
scribed with High Capacity cDNA Reverse Transcription Kit 
(Life Technologies, Warsaw, Poland), according to the manu- 
facturer’s instructions. HCoV-NL63 and HKU1 virus yields were 
determined using real-time PCR with standard curves, as pre- 
viously described (Golda et al., 2011; Pyrc et al., 2010). Briefly, 
five microliters of cDNA was amplified in 10 pl reaction mix- 
ture, containing 1x TaqMan® Universal PCR Master Mix, No 
AmpErase®UNG (Life Technologies, Warsaw, Poland), specific 
probe labeled with FAM (6-carboxyfluorescein), and TAMRA (6- 
carboxytetramethylrhodamine) (200nM) and primers (900nM 
each). All primers and probes are listed in Table 1. Rox was used 
as areference dye. The reaction was monitored on a 7500 fast real- 
time PCR machine (Life Technologies, Warsaw, Poland) with the 
following settings: 2 min 50°C, 10min at 92°C, and 40 cycles of 
15s at 92°C, and 1 min at 60°C. 


2.5. HexaPrime reverse transcription and second-strand synthesis 


Total RNA that was isolated and purified as described in Section 
2.3 was used for the synthesis of the first and second cDNA strands. 
Five microliters of total RNA were mixed with 1.5pM (1.5 pl) of 
the reverse transcription (RT) primer (a list of RT primers used in 
the current study is given in Table 2), incubated at 65°C for 5 min, 
cooled on ice for 2 min, and mixed with the RT MIX (25 U of Multi- 
Scribe Reverse Transcriptase (Life Technologies, Warsaw, Poland), 
1 pl of 10x DNA Polymerase | buffer (Thermo Scientific, Vilnius, 
Lithuania), 0.4 wl of 100 mM dNTPs, 0.2 wl of DMSO ina total volume 
of 3.5 wl). Reactions were carried out for 120 min at 37 °C. Following 
the incubation, samples were heat-inactivated at 85°C for 5 min. 

The resulting single-stranded cDNA was used for second-strand 
synthesis, with no purification step in-between. Briefly, the sample 
(10 wl) was denatured for 1 min at 95°C and cooled on ice. Subse- 
quently, 5 wl of the second-strand mix was added (0.5 U of RNase 
H, 0.5 wl of 10x DNA Polymerase I buffer, 4.5 U of DNA Polymerase I 
(Thermo Scientific, Vilnius, Lithuania), 0.1 wl of DMSO, and 3 pM of 
second-strand (SS) primer). A list of SS primers used in the current 
study is given in Table 2. Samples were incubated for 120 min at 
15°C. Subsequently, double-stranded cDNA was purified by means 
of phenol:chloroform:isoamyl alcohol (pH 8.0) isolation. DNA in 
resulting samples was precipitated with 300 wl of isopropanol 
(16h; —20°C), centrifuged (12,000 x g, 45 min), washed with 70% 
ethanol, dried at room temperature for 2 min, and re-suspended in 
5 wl of nuclease-free water (Sigma-Aldrich, Poznan, Poland). 


2.6. HexaPrime PCR amplification 


The resulting double-stranded DNA was used directly for ampli- 
fication according to the protocol given below. Amplification was 
carried out in a total volume of 20 wl with 1x DreamTaq PCR Mas- 
ter Mix (Thermo Scientific, Vilnius, Lithuania) in the presence of 
1 pM each of forward and reverse primer (RT and SS primers) and 
template DNA (5 pl). The first PCR cycling conditions included ini- 
tial denaturation for 3 min at 95°C, followed by 40 cycles of 10s at 


Primer sequence (5’-3’) 


AAA CCT CGT TGG AAG CGT GT 

CTG TGG AAA ACC TTT GGC ATC 

FAM-ATG TTA TTC AGT GCT TTG GTC CTC GTG AT-TAMRA 
CTG GTA CGA TTT TGC CTC AA 

CAA TCA CGT GGA CCC AAT AAT 

FAM-TTG AAG GCT CAG GAA GGT CTG CIT CTA A-TAMRA 


95°C, 30s at 56°C, 15s at 72°C, and then 5 min at 72 °C for the final 
elongation. 

For the second, nested PCR, the reaction was prepared in a sim- 
ilar manner, with 5 pl of the first PCR mixture as a template and 
12 pM of each primer (a list of the PCR2 primers used in the current 
study is given in Table 2). The cycling conditions included initial 
denaturation for 3 min at 95°C; followed by 13 cycles of 10s at 
95°C, 30s at 68°C (which was decreased by 1°C per cycle), and 
11s at 72°C; a further 30 cycles of 10s at 95°C, 30s at 55°C, and 
11s at 72°C; and then 5 min at 72°C for the final elongation. Fol- 
lowing the reaction, 20 wl of the sample was loaded onto a 1.5% 
agarose gel and visualized with ethidium bromide. 


2.7. Evaluation of HexaPrime assay sensitivity 


To determine the sensitivity of the assay, serial dilutions of the 
virus stocks described in Section 2.1 were used in concentrations 
ranging from 10° to 10° copies per reaction (as determined using 
real-time PCR). RT, SS synthesis, PCR and nested PCR were con- 
ducted as described above (Sections 2.5 and 2.6). The efficiency of 
different SS synthesis enzymes, T7 Polymerase (Thermo Scientific, 
Vilnius, Lithuania), DNA Polymerase I (Thermo Scientific, Vilnius, 
Lithuania), and Sequenase 2.0 (Affymetrix, United Kingdom), was 
evaluated by means of densitometry following bands separation on 
a 1.5% agarose gel. 

To test whether it is possible to detect coronaviral RNA not only 
in cell culture but also in clinical samples, 100 wl aliquots of clini- 
cal specimens, including sputum, bronchoalveolar lavage fluid, and 
nose wash, which had tested negatively for all known pathogens 
(Pyrc et al., 2012), were spiked with 1 wl of HCOV-NL63 virus stock 
(final TCIDs59 of 400). Subsequently, the HexaPrime assay was car- 
ried out and the resulting products were analyzed ona 1.5% agarose 
gel. 


2.8. Evaluation of HexaPrime assay specificity 


To determine whether the coronavirus primers that were cho- 
sen in the current study specifically amplified coronavirus nucleic 
acids, the HexaPrime assay with these primers was also conducted 
on several other human viruses, including HCoV-HKU1, influenza A 
and B viruses, parainfluenza 1-3 viruses, hRSV, human adenovirus, 
human rhinovirus, hMPV and human echovirus 9. Briefly, viral 
nucleic acids were isolated and concentrated as described above 
(Section 2.3). Samples were subjected to the HexaPrime assay using 
primer set 2 (Table 2), as described above (Sections 2.5 and 2.6). 


3. Results 
3.1. Design of HexaPrime primers 


The logic for HexaPrime primer design is similar to the univer- 
sal primer PCR design and therefore the presence of conserved 
sites in the nucleic acids of the viral group of interest is a pre- 
requisite. On the other hand, it also shares some characteristics 
with sequence-independent methods, as anchor oligonucleotides 
are added to flank the region of interest and serve as templates 
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Table 2 


Evaluated primer sets. SS: second-strand synthesis; RT: reverse transcription; 5PCR2 and 3PCR2: 5’ and 3’ primers for the second amplification step (see Fig. 1 for further 


details). 


Primer set Position? Primer 


Sequence Amplicon size? (bp) 


1 16,427-16,550 1_SS 


1_5PCR2 
1_RT 
1_3PCR2 
2 16,413-16,550 2_SS 
2_5PCR2 
2_RT 
2_3PCR2 
3 16,545-16,681 3_SS 
3_5PCR2 
3_RT 
3_3PCR2 
4 19,497-19,579 4 SS 
4_5PCR2 
4 RT 
4_3PCR2 
5 1,949,719,588 5-SS 
5_5PCR2 
5_-RT 
5-3PCR2 
6 19,959-20,032 6_SS 
6_5PCR2 
6_RT 
6_3PCR2 
7 21,507-21,568 7-SS 
7-5PCR2 
7-RT 
7-3PCR2 
8 21,564-21,631 8_SS 
8_5PCR2 
8_RT 
8_3PCR2 


* Position in alignment. 
b The expected amplicon size for HCoV-NL63. 


for subsequent amplification. First, isolated RNA is subjected to RT 
with primers designed using HexaPrime (or other) software. The 3’ 
primers (RT primers) are used for both reverse transcription and for 
the first PCR amplification. They consist of short (6-8 nt) elements 
homologous to a conserved region present in all targeted genomes 
and longer (16-18 nt) artificial anchor sequences that are used as a 
template for PCR amplification (Fig. 1). Following RT, the RNA-DNA 
duplex is disrupted with RNase H and a mixture of DNA Polymerase 
land SS primers (used for both second-strand synthesis and the first 
PCR) is added to the sample. The primers for SS synthesis are con- 
structed in the same manner as the RT primers. Therefore, the SS 
synthesis results in the addition of another anchor (16-18 nt) at 
the 5’ end of the amplified fragment (Fig. 1). The resulting double- 
stranded DNA, flanked with synthetic oligonucleotides anchored 
on both termini, is used for the first PCR amplification with RT and 
SS primers. The product of this amplification is used subsequently 
for the nested amplification with PCR2 primers that amplify a target 
within the template DNA. This amplification results in the gener- 
ation of short amplicons, which can be visualized on a standard 
agarose gel and sequenced. The general overview of the HexaPrime 
asSay is presented in Fig. 1. 

To design appropriate primers for the detection of coron- 
aviruses, a set of coronaviral sequences was obtained from the 
GenBank database (accession numbers are provided in Section 2). 
Sequences were aligned and inspected for the presence of short, 
conserved regions. Inspection was performed with three indepen- 
dent approaches: 


(1) HexaPrime software. This Java-based software for the 
identification of conserved sequences was developed for 


AGCAAGATCCAATCTAGASTGATG 155 
CAAGATCCAATCTAGASTGATGA 
CCAAGGGATTCCCAACCTYCCAAC 
AAGGGATTCCCAACCTYCCAACA 
GCAAGATCCAATCTAGAATGATSA 169 
AAGATCCAATCTAGAATGATSATT 
CCAAGGGATTCCCAACCTYCCAAC 
AAGGGATTCCCAACCTYCCAACA 
AGCAAGATCCAATCTAGARTGTTG 168 
CAAGATCCAATCTAGARTGTTGG 
AGGGATTCCCAACCTTCATCNACA 
GGATTCCCAACCTTCATCNACAAA 
AGCAAGATCCAATCTAGACARTGG 114 
CAAGATCCAATCTAGACARTGGGG 
CCAAGGGATTCCCAACCTYNGCAT 
AAGGGATTCCCAACCTYNGCATC 
AGCAAGATCCAATCTAGACARTGG 123 
CAAGATCCAATCTAGACARTGGGG 
CCAAGGGATTCCCAACCTCKWGTCA 
AAGGGATTCCCAACCTCKWGTCAT 
AGCAAGATCCAATCTAGATTTGAYA 105 
CAAGATCCAATCTAGATTTGAYAC 
CCAAGGGATTCCCAACCTAANGCA 
AAGGGATTCCCAACCTAANGCATG 
AGCAAGATCCAATCTAGANNATGC 93 
CAAGATCCAATCTAGANNATGCC 
CCAAGGGATTCCCAACCTCCATART 
AAGGGATTCCCAACCTCCATARTT 
AGCAAGATCCAATCTAGAAAYTAT 99 
CAAGATCCAATCTAGAAAYTATGG 
CCAAGGGATTCCCAACCTTAYTGA 
AAGGGATTCCCAACCTTAYTGACA 


the current study. The software is available publicly at 
http://www.virogenetics.info (for more details see Section 3.2: 
HexaPrime software). 

(2) Bioedit. Conserved sequences were identified based on the 
automatic conserved site finder feature of this software. 

(3) All sequences were inspected by eye and conserved sequences 
were marked. 


This analysis led to the identification of several conserved 
regions (6-8 nt) and allowed the design of multiple primer sets, 
which are listed in Table 2. All fragments were selected based on 
sequence identity alone without any pre-selection based on the 
location or predicted RNA structure. 


3.2. HexaPrime software 


The HexaPrime software for the identification of conserved sites 
in multiple viral genomes was prepared in the Java environment. 
Before the HexaPrime analysis, the collection of all sequences of 
interest in the FASTA format is prepared. During analysis each 
sequence is divided into overlapping words of desired length. Sub- 
sequently, for each sequence these words are converted into a bit 
array. Arrays corresponding to each sequence are then multiplied 
logically and the resulting array contains only words present in 
all tested sequences. This approach appeared to be more effec- 
tive than simple comparison of sequences and it provides complete 
information on the presence of a certain nucleotide stretch in the 
whole genome sequence, which may be of importance during in 
silico screening of potential primer pairs. The resulting list of words 
does not contain information on the position of these words in 
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Viral RNA 


ae transcription 
he 


RNA degradation (RNaseH) 
second strand synthesis (Polymerase I) 


viral RNA / CDNA hybrid 


Double-stranded cDNA 


ia} 
/\ °° 
1st & 2nd PCR (Taq polymerase) 


Amplification 


Fig. 1. The HexaPrime assay. Briefly, reverse transcription (RT) and second-strand 
(SS) synthesis reactions were conducted on total RNA (black) in the presence of 
primers comprising short (6-8 nt) elements that recognize conserved regions of 
viral nucleic acids (dark green and dark blue for RT and SS primers, respectively) and 
longer (16-18 nt) anchors that serve as a template for subsequent PCR amplification 
(light green and light blue for RT and SS primers, respectively). Products of the PCR 
amplification of coronaviral RNA may be analyzed further by gel electrophoresis and 
sequencing. 


each genome; therefore, these are recovered subsequently from 
sequence data and the distance between each word pair is deter- 
mined. Results are presented as a list of words with distances 
between them and their position in the original sequences of each 
genome. The overview of the method is presented in Fig. 2. 


3.3. Evaluation of HexaPrime primer sets 


To validate the designed primer sets, all were tested on samples 
containing HCoV-NL63 RNA. All steps of the procedure were eval- 
uated and optimized carefully to determine the optimal reaction 
conditions. Briefly, different enzymes and buffers, concentrations 
of salts and primers, and thermal profiles for amplification were 
tested. These results are not described in the manuscript due to 
Space constraints, but the final optimal conditions are presented. 

RNA was isolated from virus stocks and concentrated by pre- 
cipitation in the presence of glycogen. The concentration of the 
RNA samples resulted in a proportional increase in virus yield, as 
determined by real-time PCR (data not shown). Subsequent anal- 
ysis showed that this step also enhanced the sensitivity of the 
HexaPrime assay (data not shown). Isolated and concentrated RNA 
was processed according to the HexaPrime method as described 
in Section 2. Briefly, RNA was reverse transcribed using the Multi- 
Scribe Reverse Transcriptase and the RT primers listed in Table 2. 
Second-strand synthesis was conducted using DNA Polymerase I 
and the SS primers listed in Table 2. The first and second PCR 
amplification was conducted with RT/SS primers and PCR2 primers, 
respectively. Gel electrophoresis of the resulting products yielded 
several bands, although only amplification with primer sets 2, 
4, 5 and 8 yielded distinct bands in samples containing viral 
RNA (Fig. 3A). DNA fragments of the appropriate size were then 
excised from the gel, isolated, and cloned into the pTZ57R plasmid. 
Clones were sequenced and their identity was confirmed. Based on 
assay performance (densitometry) and sequencing results, primer 
set 2 was used for subsequent analyses with alpha- and beta- 
coronaviruses (Fig. 3B). 
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Fig. 2. The HexaPrime software. The general algorithm for the software is presented. 
(A) Sequences are divided into six nucleotide-long words. (B) Subsequently, these 
words are converted into a numerical format and a bit array is formed. Arrays cre- 
ated for each sequence are further logically multiplied and the resulting array is 
converted back to words. (C) The resulting word list contains information on the 
genomic position and distance between words in each of the genomes. 


3.4. Sensitivity of HexaPrime assay 


To determine the sensitivity of the HexaPrime assay, the con- 
centration of viral RNA in cultured samples was determined using 
real-time PCR. Following this, the RNA concentration was set to 
10° copies ml—! and serial, 10-fold dilutions to 10° copies ml"! 
were prepared. These samples were subjected to the HexaPrime 
assay, and the results presented in Fig. 3C show that the assay is 
sensitive to 10° copies ml~!. To test whether the performance of 
the vaguest step of the reaction (i.e., SS synthesis) is optimal, an 
effort was made to evaluate other enzymes able to carry out the 
SS synthesis. As clearly visible in Fig. 3C, DNA Polymerase I and 
Polymerase T7 showed similar efficiency, while SS synthesis with 
Sequenase 2.0 did not result in discrete bands. 


3.5. Cross-reactivity of HexaPrime primers 


To determine whether the HexaPrime assay is specific toward 
coronaviruses, primer set 2 for coronaviruses was tested for 
cross-reactivity with the viral nucleic acids of other respiratory 
pathogens. Briefly, RNA or DNA of hMPV, adenovirus, rhinovirus, 
enterovirus, influenza A and B viruses, parainfluenza 1-3 viruses 
and hRSV was isolated from cell culture and subjected to the 
HexaPrime assay without dilution. Even though some background 
amplification was observed for some samples, no cross-reactivity 
(i.e., no products of similar size or quantity) was detected in any 
of the assays, showing that the HexaPrime assay is highly specific 
(Fig. 4). No amplification was observed in negative control samples 
(mock-infected cell culture) or water. 
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Fig. 3. Evaluation of the HexaPrime assay. (A) Evaluation of different primer pairs for the detection of coronaviruses. Analysis was conducted using the HCoV-NL63 virus and 
all primer sets given in Table 2 were tested. Only amplification with primer sets 2, 4, 5 and 8 yielded distinct bands. Sequencing of products and analysis of fragment size 
revealed that only primer set 2 allowed efficient amplification of the desired product. M: size marker; mock-infected (—) or HCoOV-NL63-infected (+) cell culture supernatant. 
(B) Detection of HCOV-NL63 and HCoV-HKU1 with the HexaPrime assay using primer set 2. All experimental procedures were conducted as described in Section 2. M: size 
marker; W: water; NL63 and HKU1: mock-infected (—) or virus-infected (+) cell culture supernatant. (C) Sensitivity of the HexaPrime assay. Concentrated samples containing 
viral RNA (10° copies ml~') were subjected to 10-fold serial dilutions in cell culture supernatant and the HexaPrime assay was conducted. For each RNA concentration, three 
different enzymes for SS DNA synthesis were trialed. A, B and C denote DNA Polymerase I, T7 Polymerase, and Sequenase 2.0, respectively. 


3.6. HexaPrime assay performance in clinical samples 


The HexaPrime assay is designed to detect viral RNA in complex 
clinical specimens. Therefore, assay performance was evaluated 
using different sample types, including nose wash, bronchoalveolar 
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lavage fluid and sputum, spiked with virus. Briefly, 0.5 pl of cul- 
ture supernatant from LLC-MK2 cells infected with HCoV-NL63 was 
inoculated into 100 wl of each clinical sample type (all of which had 
previously tested negative for coronaviruses). Reactions were per- 
formed according to the protocol described above and the resulting 
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Fig.4. Specificity of the HexaPrime assay. The HexaPrime assay was conducted on samples containing different human respiratory viruses, including human metapneumovirus 
(hMPV), adenovirus (AdV), rhinovirus (RV), enterovirus (EV), influenza A and B viruses (IAV and IBV, respectively); parainfluenza 1, 2 and 3 viruses (P1, P2 and P3, respectively); 


and respiratory syncytial virus (RSV). M: size marker. 
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Fig. 5. Detection of HCoV-NL63 in different types of clinical samples. Different sam- 
ple types (sputum, bronchoalveolar fluid (BALF) and nose wash) were spiked with 
HCoV-NL63. No inhibition of HexaPrime reactions and no additional bands were 
observed. W: water; — and + signs: samples not inoculated or inoculated with 
HCoV-NL63. 


products were analyzed by standard gel electrophoresis. No inhi- 
bition of the reaction was observed for any of the clinical samples 
tested (Fig. 5). 


4. Discussion 


Numerous methods have been developed for the detection of 
viral pathogens, and the universal primer approach, for all its limi- 
tations, is undoubtedly the most sensitive method for semi-generic 
detection (Moes et al., 2005; Pyrc et al., 2012). On the other hand, 
it is only appropriate for identifying viruses that are closely related 
to known viruses. For the detection of novel viruses that are not 
detected using universal primers, several sequence-independent 
techniques have been developed. These include random-priming 
PCR, representational difference analysis (RDA), and virus discovery 
based on cDNA amplified fragment length polymorphism (VIDISCA) 
(Adachi et al., 2004; Allander et al., 2001, 2005; Anway et al., 2001; 
Drosten et al., 2003; Escutenaire et al., 2007; Ksiazek et al., 2003; Lu 
et al., 2004; Moes et al., 2005; Stephensen et al., 1999; van der Hoek 
et al., 2004). These methods are named sequence-independent as 
the amplification is not driven by specific primers, but instead a ran- 
dom (or semi-random) amplification is carried out. For example, a 
random-priming PCR method utilizes primers containing random 
nucleotides at their 5’ termini, which are able to attach to and 
amplify targets, including viral genetic material, indiscriminately 
(Fouchier et al., 2004). A more specific approach is employed by 
RDA, where primers selected arbitrarily are used and, following 
the PCR reaction, infected samples are compared to those of non- 
infected samples (Liang and Pardee, 1992; Lu et al., 2004). A very 
different approach is employed in the VIDISCA and SISPA methods. 
The whole amplification process is based on the presence of com- 
mon restriction sites in RNA/DNA molecules. Following enzymatic 
restriction, these restriction sites serve as sites for ligation of syn- 
thetic DNA fragments which, in turn, constitute primer attachment 
sites. In this way, every fragment flanked with certain restriction 
sites will be amplified. Comparison of the characteristic amplifica- 
tion pattern of infected and control samples may serve to identify 
nucleic acids of viral origin. Unfortunately, all these approaches are 
extremely laborious and high-throughput analysis is not possible 
(Allander et al., 2001; Reyes and Kim, 1991; van der Hoek et al., 
2004). 

In the current manuscript, a novel assay for the detection of 
coronaviruses is presented, the HexaPrime assay. Although the sen- 
sitivity of the assay with the primer sets tested in the current work 
is indisputably lower than that described for universal primers, 
it is similar to that described for sequence-independent methods. 


Because a shorter conserved nucleotide stretch is required for suc- 
cessful priming, the major advantage of the HexaPrime assay is that 
it provides broader specificity when compared to the standard uni- 
versal primer approach. It is also worth noting that methods for 
sequence-independent detection of viral pathogens (e.g., VIDISCA 
or RDA) require highly-trained personnel and appropriate infra- 
structure, while the HexaPrime assay is relatively simple. The only 
additional step, compared to a typical nested PCR, is the ligation of 
anchors to 5’ and 3’ ends of the RNA fragments, which is a single- 
tube reaction. 

To increase the sensitivity of the method, an additional step was 
added to the protocol that includes precipitation of RNA in the pres- 
ence of glycogen. This concentration method was confirmed to be 
effective by quantitative PCR (data not shown). The addition of such 
a step may therefore also be beneficial for other virus detection 
methods, such as VIDISCA or SISPA (Allander et al., 2001; van der 
Hoek et al., 2004). 

The analyses performed in the current study show clearly that 
the HexaPrime assay is highly specific for coronaviral species and 
it may therefore be useful for routine screening of clinical samples. 
Due to the limited sensitivity of the method, the in vitro culture 
of pathogens may be a prerequisite for successful amplification. 
To this end, fully-differentiated airway epithelium cultures may 
be a useful pre-amplification tool to increase the titers of airway 
viruses (Pyrc et al., 2012). Obviously, due to technical limitations, 
this method may be used mostly in research facilities focused on 
discovery of new pathogens, rather than in clinical virology labo- 
ratory settings. The method has been tested for the detection of 
viruses that belong to a single viral family thus far, although it may 
also be adopted for other RNA viruses. 
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